On the Furthest Hyperplane Problem and Maximal Margin Clustering
نویسندگان
چکیده
This paper introduces the Furthest Hyperplane Problem (FHP), which is an unsupervised counterpart of Support Vector Machines. Given a set of n points in R, the objective is to produce the hyperplane (passing through the origin) which maximizes the separation margin, that is, the minimal distance between the hyperplane and any input point. To the best of our knowledge, this is the first paper achieving provable results regarding FHP. We provide both lower and upper bounds to this NP-hard problem. First, we give a simple randomized algorithm whose running time is n ) where θ is the optimal separation margin. We show that its exponential dependency on 1/θ is tight, up to sub-polynomial factors, assuming SAT cannot be solved in sub-exponential time. Next, we give an efficient approximation algorithm. For any α ∈ [0, 1], the algorithm produces a hyperplane whose distance from at least 1 − 5α fraction of the points is at least α times the optimal separation margin. Finally, we show that FHP does not admit a PTAS by presenting a gap preserving reduction from a particular version of the PCP theorem. ∗Yahoo! Research [email protected]. †Yahoo! Research, [email protected]. ‡IAS, [email protected]. Supported by DMS-0835373. §Technion Institute of Technology and Yahoo! Research [email protected]. ¶Princeton University and Yahoo! Research [email protected].
منابع مشابه
Unsupervised SVMs: On the Complexity of the Furthest Hyperplane Problem
This paper introduces the Furthest Hyperplane Problem (FHP), which is an unsupervised counterpart of Support Vector Machines. Given a set of n points in R, the objective is to produce the hyperplane (passing through the origin) which maximizes the separation margin, that is, the minimal distance between the hyperplane and any input point. To the best of our knowledge, this is the first paper ac...
متن کاملBayes Optimal Hyperplanes! Maximal Margin Hyperplanes
Maximal margin classifiers are a core technology in modern machine learning. They have strong theoretical justifications and have shown empirical successes. We provide an alternative justification for maximal margin hyperplane classifiers by relating them to Bayes optimal classifiers that use Parzen windows estimations with Gaussian kernels. For any value of the smoothing parameter (the width o...
متن کاملIndoor WLAN positioning Using Hybrid SVM Hyperplane Margin Clustering and Regression
This paper proposes a novel indoor Wireless Local Area Network (WLAN) positioning algorithm by using the Support Vector Machine (SVM) Hyperplane Margin Clustering and Regression (SVMCR). First of all, we rely on the SVM Hyperplane Margin Clustering (SVMC) to reduce the search space of the fingerprint database. Second, we use the Support Vector Regression (SVR) to characterize the relations of t...
متن کاملEfficient Maximum Margin Clustering via Cutting Plane Algorithm
Maximum margin clustering (MMC) is a recently proposed clustering method, which extends the theory of support vector machine to the unsupervised scenario and aims at finding the maximum margin hyperplane which separates the data from different classes. Traditionally, MMC is formulated as a non-convex integer programming problem and is thus difficult to solve. Several methods have been proposed ...
متن کاملMinimum Spectral Connectivity Projection Pursuit for Unsupervised Classification
We study the problem of determining the optimal univariate subspace for maximising the separability of a binary partition of unlabeled data, as measured by spectral graph theory. This is achieved by finding projections which minimise the second eigenvalue of the Laplacian matrices of the projected data, which corresponds to a non-convex, non-smooth optimisation problem. We show that the optimal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1107.1358 شماره
صفحات -
تاریخ انتشار 2011